Model shapes config #2036

jainapurva · 2025-04-10T19:18:04Z

This pull request introduces significant updates to the microbenchmarking framework, focusing on new model types, and enhanced shape generation options. The changes aim to expand functionality, and more extensive benchmarking configurations.

Enhancements to Model Types and Shape Generation

Added support for new model types, including ln_linear_<activation> (e.g., sigmoid, relu, gelu) and transformer_block with self-attention and MLP. These are documented in benchmarks/microbenchmarks/README.md.
Introduced multiple shape generation options (custom, llama, pow2, pow2_extended, sweep) to support diverse matrix shapes for benchmarking. These options are implemented in benchmark_runner.py and documented in the README.

Refactoring and Code Simplification

Refactored model creation logic by replacing create_model_and_input with create_model_and_input_data, now imported from torchao.testing.model_architectures. This centralizes model definitions and input data generation.
Removed redundant model definitions (ToyLinearModel, LNLinearSigmoid) from utils.py, consolidating them into torchao.testing.model_architectures.

Future TODO: Refactor Torchao to use model definitions from torchao.testing.model_architectures, #2078

Updates to Configuration

Expanded benchmark_config.yml to include configurations for new model types and shape generation options, such as llama and pow2.

Documentation Improvements

Updated README.md to provide detailed descriptions of new model types and shape generation options, ensuring users can easily understand and utilize the new features.

These changes collectively enhance the flexibility, maintainability, and usability of the benchmarking framework.

Sample configuration.yml for inference benchmarks

benchmark_mode: "inference"
quantization_config_recipe_names:
  - "float8dq"
  - "float8wo"
output_dir: "benchmarks/microbenchmarks/results"
model_params:
  - name: "ln_linear_sigmoid_cuda"
    matrix_shapes:
      - name: "custom"
        shapes: [
          [2048, 4096, 1024],
        ]
    high_precision_dtype: "torch.bfloat16"
    use_torch_compile: true
    torch_compile_mode: "max-autotune"
    device: "cuda"
    model_type: "ln_linear_sigmoid"
    enable_profiler: true

  - name: "bf16_transformer_block"
    matrix_shapes:
      - name: "custom"
        shapes: [
          [2048, 4096, 1024],  # For transformer_block, k is the hidden dimension
        ]
    high_precision_dtype: "torch.bfloat16"
    use_torch_compile: true
    torch_compile_mode: "max-autotune"
    device: "cuda"
    model_type: "transformer_block" # TODO: Add a custom model (Figure out how to do this, maybe pass a .py file with model definition)
    enable_profiler: true

  - name: "large_bf16_ln_linear"
    matrix_shapes:
      - name: "llama"  # Example of using LLaMa shapes
      - name: "pow2"  # Example of using power of 2 shapes
        min_power: 10  # 1024
        max_power: 12  # 4096
      - name: "pow2_extended"  # Example of using extended power of 2 shapes
        min_power: 10  # 1024
        max_power: 11  # 2048
      - name: "sweep"  # Example of using sweep shapes (commented out as it generates many shapes)
        min_power: 8   # 256
        max_power: 9   # 512
    high_precision_dtype: "torch.bfloat16"
    use_torch_compile: true
    torch_compile_mode: "max-autotune"
    device: "cuda"
    model_type: "linear"
    enable_profiler: true  # Enable profiling for this model

[ghstack-poisoned]

pytorch-bot · 2025-04-10T19:18:08Z

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/pytorch/ao/2036

📄 Preview Python docs built from this PR

Note: Links to docs will display an error until the docs builds have been completed.

✅ You can merge normally! (6 Unrelated Failures)

As of commit 8f73ebf with merge base d06b3e3 ():

BROKEN TRUNK - The following jobs failed but were present on the merge base:

👉 Rebase onto the `viable/strict` branch to avoid these failures

Run Regression Tests / test (CPU 2.4, linux.4xlarge, torch==2.4.0 --index-url https://download.pytorch.org/whl/cpu, cpu) / linux-job (gh) (trunk failure)
test/quantization/test_qat.py::TestQAT::test_qat_8da4w_prepare_vs_convert_2
Run Regression Tests / test (CPU 2.5.1, linux.4xlarge, torch==2.5.1 --index-url https://download.pytorch.org/whl/cpu, cpu) / linux-job (gh) (trunk failure)
test/quantization/test_qat.py::TestQAT::test_qat_8da4w_prepare_vs_convert_2
Run Regression Tests / test (CUDA 2.4, linux.g5.12xlarge.nvidia.gpu, torch==2.4.0, cuda, 12.1) / linux-job (gh) (trunk failure)
test/quantization/test_qat.py::TestQAT::test_qat_8da4w_prepare_vs_convert_2
Run Regression Tests / test (CUDA 2.5.1, linux.g5.12xlarge.nvidia.gpu, torch==2.5.1 --index-url https://download.pytorch... / linux-job (gh) (trunk failure)
test/quantization/test_qat.py::TestQAT::test_qat_8da4w_prepare_vs_convert_2
Run Regression Tests / test-nightly (CPU Nightly, linux.4xlarge, --pre torch --index-url https://download.pytorch.org/wh... / linux-job (gh) (trunk failure)
test/quantization/test_qat.py::TestQAT::test_qat_8da4w_prepare_vs_convert_2
Run Regression Tests / test-nightly (CUDA Nightly, linux.g5.12xlarge.nvidia.gpu, --pre torch --index-url https://downloa... / linux-job (gh) (trunk failure)
test/quantization/test_qat.py::TestQAT::test_qat_8da4w_prepare_vs_convert_2

This comment was automatically generated by Dr. CI and updates every 15 minutes.

Copilot

Copilot reviewed 10 out of 10 changed files in this pull request and generated 2 comments.

benchmarks/microbenchmarks/utils.py

…shapes_config

Copilot

Copilot reviewed 10 out of 10 changed files in this pull request and generated 1 comment.

benchmarks/microbenchmarks/utils.py

…shapes_config

HDCharles · 2025-04-15T19:27:08Z

torchao/testing/model_architectures.py

+import torch.nn as nn
+
+
+class ToyLinearModel(torch.nn.Module):


if you plan to use these for benchmarks they should be in torchao rather than testing. I think if someone pip installs torchao it doesn't install the test dir.

Also should probably mark as a TODO to conslidate all random model architecture definitions into one place. ToyLinear shows up all over hte place in our testing code.

torchao.testing is built in the wheel. As discussed offline, we can define models in torchao.testing and use it everywhere in torchao, tests and benchmarks

HDCharles · 2025-04-15T19:30:26Z

it looks like htere's more here than just model shapes config, should add more documentaiton to PR description about all the things that are being changed/updated.

HDCharles

anything expected to be used outside of tests should be in ao. imports in benchmarks to testing should have a really good justification.

HDCharles

lgtm

jerryzh168 · 2025-04-23T05:18:52Z

sorry will need to revert this to fix diff train, we can't touch torchao/quantization/qat/embedding.py torchao/quantization/qat/linear.py for now

This reverts commit a724a37.

Revert "Model shapes config (#2036)" This reverts commit a724a37.

jainapurva · 2025-04-23T18:10:48Z

sorry will need to revert this to fix diff train, we can't touch torchao/quantization/qat/embedding.py torchao/quantization/qat/linear.py for now

It was an automatic lint fix, as the ruff tests were failing. I can reland without touching those two files.

jerryzh168 · 2025-04-23T18:13:35Z

@jainapurva should be fine now, diff train is restored

jainapurva added 5 commits April 8, 2025 14:35

Update

8b22a68

[ghstack-poisoned]

Add profiler

04f39ef

Add support for different models and different shapes

4b7ea5d

Add ruff fixes

33fa3ca

Updates

5ee6b58

jainapurva requested a review from Copilot April 10, 2025 19:18

facebook-github-bot added the CLA Signed This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed. label Apr 10, 2025

Copilot AI reviewed Apr 10, 2025

View reviewed changes

benchmarks/microbenchmarks/utils.py Outdated Show resolved Hide resolved

benchmarks/microbenchmarks/utils.py Outdated Show resolved Hide resolved

jainapurva added 2 commits April 10, 2025 12:40

Updates

345a00c

Merge remote-tracking branch 'origin/bench-gpu-profiling' into model_…

6e88306

…shapes_config

jainapurva added the topic: performance Use this tag if this PR improves the performance of a feature label Apr 10, 2025

jainapurva requested review from Copilot, jerryzh168, drisspg and HDCharles April 10, 2025 22:44

Copilot AI reviewed Apr 10, 2025

View reviewed changes

benchmarks/microbenchmarks/utils.py Outdated Show resolved Hide resolved

Updates

bbcba36

jainapurva force-pushed the model_shapes_config branch from bcdb20c to bbcba36 Compare April 10, 2025 23:12

jainapurva marked this pull request as ready for review April 14, 2025 19:34

jainapurva added 2 commits April 14, 2025 16:01

updates

d5bdb4a

Merge remote-tracking branch 'origin/bench-gpu-profiling' into model_…

7677902

…shapes_config

HDCharles reviewed Apr 15, 2025

View reviewed changes

HDCharles requested changes Apr 15, 2025

View reviewed changes

jainapurva changed the base branch from main to bench-gpu-profiling April 15, 2025 19:41

Merge remote-tracking branch 'origin/main' into model_shapes_config

06f5ee7

jainapurva changed the base branch from bench-gpu-profiling to main April 18, 2025 17:47

jainapurva requested a review from HDCharles April 18, 2025 18:29

Added a future todo

784ec94

jainapurva mentioned this pull request Apr 18, 2025

Refactor torchao and tests to use model architectures from torchao.testing.model_architectures #2078

Open

HDCharles approved these changes Apr 22, 2025

View reviewed changes

jainapurva added 2 commits April 22, 2025 15:43

Merge remote-tracking branch 'origin/main' into model_shapes_config

9f5e595

Lint fixes

8f73ebf

jainapurva merged commit a724a37 into main Apr 23, 2025
12 of 18 checks passed

jerryzh168 added a commit that referenced this pull request Apr 23, 2025

Revert "Model shapes config (#2036)"

dc05be3

This reverts commit a724a37.

jerryzh168 mentioned this pull request Apr 23, 2025

Revert "Model shapes config" #2114

Merged

jerryzh168 added a commit that referenced this pull request Apr 23, 2025

Revert "Model shapes config" (#2114)

7095666

Revert "Model shapes config (#2036)" This reverts commit a724a37.

jainapurva mentioned this pull request Apr 23, 2025

Model shapes config #2116

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Model shapes config #2036

Model shapes config #2036

Uh oh!

jainapurva commented Apr 10, 2025 •

edited

Loading

Uh oh!

pytorch-bot bot commented Apr 10, 2025 •

edited

Loading

Uh oh!

Copilot AI left a comment

Uh oh!

Uh oh!

Uh oh!

Copilot AI left a comment

Uh oh!

Uh oh!

HDCharles Apr 15, 2025

Uh oh!

HDCharles Apr 15, 2025

Uh oh!

jainapurva Apr 18, 2025

Uh oh!

HDCharles commented Apr 15, 2025

Uh oh!

HDCharles left a comment

Uh oh!

HDCharles left a comment

Uh oh!

Uh oh!

jerryzh168 commented Apr 23, 2025 •

edited

Loading

Uh oh!

jainapurva commented Apr 23, 2025

Uh oh!

jerryzh168 commented Apr 23, 2025

Uh oh!

Uh oh!

		import torch.nn as nn


		class ToyLinearModel(torch.nn.Module):

Model shapes config #2036

Model shapes config #2036

Uh oh!

Conversation

jainapurva commented Apr 10, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Enhancements to Model Types and Shape Generation

Refactoring and Code Simplification

Updates to Configuration

Documentation Improvements

Sample configuration.yml for inference benchmarks

Uh oh!

pytorch-bot bot commented Apr 10, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/pytorch/ao/2036

✅ You can merge normally! (6 Unrelated Failures)

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

HDCharles Apr 15, 2025

Choose a reason for hiding this comment

Uh oh!

HDCharles Apr 15, 2025

Choose a reason for hiding this comment

Uh oh!

jainapurva Apr 18, 2025

Choose a reason for hiding this comment

Uh oh!

HDCharles commented Apr 15, 2025

Uh oh!

HDCharles left a comment

Choose a reason for hiding this comment

Uh oh!

HDCharles left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

jerryzh168 commented Apr 23, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

jainapurva commented Apr 23, 2025

Uh oh!

jerryzh168 commented Apr 23, 2025

Uh oh!

Uh oh!

jainapurva commented Apr 10, 2025 •

edited

Loading

pytorch-bot bot commented Apr 10, 2025 •

edited

Loading

jerryzh168 commented Apr 23, 2025 •

edited

Loading